Dataset statistics
| Number of variables | 33 |
|---|---|
| Number of observations | 1048575 |
| Missing cells | 30060590 |
| Missing cells (%) | 86.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 264.0 MiB |
| Average record size in memory | 264.0 B |
Variable types
| Numeric | 1 |
|---|---|
| Categorical | 6 |
| Text | 11 |
| Unsupported | 15 |
Carrier has constant value "Airtel" | Constant |
Unnamed: 26 has constant value "0.9" | Constant |
Unnamed: 27 has constant value "0.0" | Constant |
Unnamed: 28 has constant value "user" | Constant |
Unnamed: 32 has constant value "0.0" | Constant |
Number is highly overall correlated with Unnamed: 31 | High correlation |
Unnamed: 12 is highly overall correlated with Unnamed: 31 | High correlation |
Unnamed: 31 is highly overall correlated with Number and 1 other fields | High correlation |
Unnamed: 12 is highly imbalanced (70.1%) | Imbalance |
Name has 57060 (5.4%) missing values | Missing |
Gender has 1006524 (96.0%) missing values | Missing |
JobTitle has 1017036 (97.0%) missing values | Missing |
CompanyName has 1033131 (98.5%) missing values | Missing |
Email has 782691 (74.6%) missing values | Missing |
Facebook has 1029196 (98.2%) missing values | Missing |
Twitter has 1040714 (99.3%) missing values | Missing |
Unnamed: 10 has 1035163 (98.7%) missing values | Missing |
Unnamed: 11 has 1041161 (99.3%) missing values | Missing |
Unnamed: 12 has 1045043 (99.7%) missing values | Missing |
Unnamed: 13 has 1047413 (99.9%) missing values | Missing |
Unnamed: 14 has 1048162 (> 99.9%) missing values | Missing |
Unnamed: 15 has 1048457 (> 99.9%) missing values | Missing |
Unnamed: 16 has 1048531 (> 99.9%) missing values | Missing |
Unnamed: 17 has 1048555 (> 99.9%) missing values | Missing |
Unnamed: 18 has 1048561 (> 99.9%) missing values | Missing |
Unnamed: 19 has 1048564 (> 99.9%) missing values | Missing |
Unnamed: 20 has 1048568 (> 99.9%) missing values | Missing |
Unnamed: 21 has 1048569 (> 99.9%) missing values | Missing |
Unnamed: 22 has 1048571 (> 99.9%) missing values | Missing |
Unnamed: 23 has 1048571 (> 99.9%) missing values | Missing |
Unnamed: 24 has 1048571 (> 99.9%) missing values | Missing |
Unnamed: 25 has 1048572 (> 99.9%) missing values | Missing |
Unnamed: 26 has 1048574 (> 99.9%) missing values | Missing |
Unnamed: 27 has 1048574 (> 99.9%) missing values | Missing |
Unnamed: 28 has 1048574 (> 99.9%) missing values | Missing |
Unnamed: 29 has 1048573 (> 99.9%) missing values | Missing |
Unnamed: 30 has 1048572 (> 99.9%) missing values | Missing |
Unnamed: 31 has 1048573 (> 99.9%) missing values | Missing |
Unnamed: 32 has 1048574 (> 99.9%) missing values | Missing |
Unnamed: 31 is uniformly distributed | Uniform |
Number has unique values | Unique |
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 21 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 22 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 23 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 24 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 25 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 29 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 30 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
| Analysis started | 2024-07-18 08:51:44.852370 |
|---|---|
| Analysis finished | 2024-07-18 08:52:46.006929 |
| Duration | 1 minute and 1.15 second |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
Number
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 1048575 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.1758429 × 1011 |
| Minimum | 9.17032 × 1011 |
|---|---|
| Maximum | 9.1799549 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 MiB |
Quantile statistics
| Minimum | 9.17032 × 1011 |
|---|---|
| 5-th percentile | 9.1703227 × 1011 |
| Q1 | 9.1709341 × 1011 |
| median | 9.1770267 × 1011 |
| Q3 | 9.1789389 × 1011 |
| 95-th percentile | 9.1799518 × 1011 |
| Maximum | 9.1799549 × 1011 |
| Range | 9.6348589 × 108 |
| Interquartile range (IQR) | 8.0048319 × 108 |
Descriptive statistics
| Standard deviation | 3.9533173 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.00043083968 |
| Kurtosis | -1.5734815 |
| Mean | 9.1758429 × 1011 |
| Median Absolute Deviation (MAD) | 2.908676 × 108 |
| Skewness | -0.45752983 |
| Sum | 9.6215595 × 1017 |
| Variance | 1.5628717 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.170320009 × 1011 | 1 | < 0.1% |
| 9.178934804 × 1011 | 1 | < 0.1% |
| 9.178934804 × 1011 | 1 | < 0.1% |
| 9.178934804 × 1011 | 1 | < 0.1% |
| 9.178934804 × 1011 | 1 | < 0.1% |
| 9.178934804 × 1011 | 1 | < 0.1% |
| 9.178934804 × 1011 | 1 | < 0.1% |
| 9.178934804 × 1011 | 1 | < 0.1% |
| 9.178934804 × 1011 | 1 | < 0.1% |
| 9.178934804 × 1011 | 1 | < 0.1% |
| Other values (1048565) | 1048565 |
| Value | Count | Frequency (%) |
| 9.170320001 × 1011 | 1 | |
| 9.170320001 × 1011 | 1 | |
| 9.170320001 × 1011 | 1 | |
| 9.170320001 × 1011 | 1 | |
| 9.170320001 × 1011 | 1 | |
| 9.170320001 × 1011 | 1 | |
| 9.170320001 × 1011 | 1 | |
| 9.170320001 × 1011 | 1 | |
| 9.170320001 × 1011 | 1 | |
| 9.170320001 × 1011 | 1 |
| Value | Count | Frequency (%) |
| 9.17995486 × 1011 | 1 | |
| 9.17995486 × 1011 | 1 | |
| 9.17995486 × 1011 | 1 | |
| 9.17995486 × 1011 | 1 | |
| 9.17995486 × 1011 | 1 | |
| 9.17995486 × 1011 | 1 | |
| 9.17995486 × 1011 | 1 | |
| 9.179954859 × 1011 | 1 | |
| 9.179954859 × 1011 | 1 | |
| 9.179954859 × 1011 | 1 |
Carrier
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.0 MiB |
| Airtel |
|---|
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 6291450 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Airtel |
|---|---|
| 2nd row | Airtel |
| 3rd row | Airtel |
| 4th row | Airtel |
| 5th row | Airtel |
Common Values
| Value | Count | Frequency (%) |
| Airtel | 1048575 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| airtel | 1048575 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1048575 | |
| i | 1048575 | |
| r | 1048575 | |
| t | 1048575 | |
| e | 1048575 | |
| l | 1048575 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5242875 | |
| Uppercase Letter | 1048575 | 16.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1048575 | |
| r | 1048575 | |
| t | 1048575 | |
| e | 1048575 | |
| l | 1048575 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1048575 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6291450 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1048575 | |
| i | 1048575 | |
| r | 1048575 | |
| t | 1048575 | |
| e | 1048575 | |
| l | 1048575 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6291450 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1048575 | |
| i | 1048575 | |
| r | 1048575 | |
| t | 1048575 | |
| e | 1048575 | |
| l | 1048575 |
Name
Text
MISSING 
| Distinct | 684244 |
|---|---|
| Distinct (%) | 69.0% |
| Missing | 57060 |
| Missing (%) | 5.4% |
| Memory size | 8.0 MiB |
Length
| Max length | 277 |
|---|---|
| Median length | 105 |
| Mean length | 11.963313 |
| Min length | 1 |
Characters and Unicode
| Total characters | 11861804 |
|---|---|
| Distinct characters | 195 |
| Distinct categories | 17 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 629316 ? |
|---|---|
| Unique (%) | 63.5% |
Sample
| 1st row | Raj Kumar. Mncl |
|---|---|
| 2nd row | Deva |
| 3rd row | Lakshay |
| 4th row | Prathap |
| 5th row | Shekar Chinnu |
| Value | Count | Frequency (%) |
| reddy | 25690 | 1.3% |
| kumar | 25320 | 1.3% |
| k | 18258 | 0.9% |
| sai | 16598 | 0.8% |
| 2 | 15981 | 0.8% |
| m | 14910 | 0.7% |
| s | 14868 | 0.7% |
| raju | 13802 | 0.7% |
| p | 13567 | 0.7% |
| b | 11636 | 0.6% |
| Other values (238013) | 1849843 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1908336 | |
| 1029514 | 8.7% | |
| i | 733298 | 6.2% |
| n | 639341 | 5.4% |
| r | 604993 | 5.1% |
| h | 604453 | 5.1% |
| e | 514653 | 4.3% |
| u | 455189 | 3.8% |
| d | 357690 | 3.0% |
| s | 337274 | 2.8% |
| Other values (185) | 4677063 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8574922 | |
| Uppercase Letter | 1980919 | 16.7% |
| Space Separator | 1030181 | 8.7% |
| Decimal Number | 138750 | 1.2% |
| Other Punctuation | 111603 | 0.9% |
| Currency Symbol | 5523 | < 0.1% |
| Dash Punctuation | 5362 | < 0.1% |
| Other Symbol | 3394 | < 0.1% |
| Math Symbol | 2527 | < 0.1% |
| Open Punctuation | 1985 | < 0.1% |
| Other values (7) | 6638 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 319575 | |
| R | 188109 | 9.5% |
| M | 159359 | 8.0% |
| K | 154666 | 7.8% |
| A | 150492 | 7.6% |
| P | 135528 | 6.8% |
| B | 111200 | 5.6% |
| N | 106612 | 5.4% |
| V | 95366 | 4.8% |
| C | 75403 | 3.8% |
| Other values (57) | 484609 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1908336 | |
| i | 733298 | 8.6% |
| n | 639341 | 7.5% |
| r | 604993 | 7.1% |
| h | 604453 | 7.0% |
| e | 514653 | 6.0% |
| u | 455189 | 5.3% |
| d | 357690 | 4.2% |
| s | 337274 | 3.9% |
| m | 319181 | 3.7% |
| Other values (45) | 2100514 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 96377 | |
| @ | 2947 | 2.6% |
| ' | 2184 | 2.0% |
| * | 1687 | 1.5% |
| ? | 1163 | 1.0% |
| § | 924 | 0.8% |
| & | 845 | 0.8% |
| # | 803 | 0.7% |
| … | 680 | 0.6% |
| ! | 642 | 0.6% |
| Other values (11) | 3351 | 3.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 34009 | |
| 1 | 23029 | |
| 3 | 14742 | |
| 9 | 12031 | 8.7% |
| 7 | 11960 | 8.6% |
| 0 | 11880 | 8.6% |
| 4 | 9304 | 6.7% |
| 5 | 7774 | 5.6% |
| 8 | 7284 | 5.2% |
| 6 | 6737 | 4.9% |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 1492 | |
| ¬ | 696 | |
| | | 136 | 5.4% |
| + | 87 | 3.4% |
| ~ | 79 | 3.1% |
| < | 14 | 0.6% |
| = | 13 | 0.5% |
| > | 10 | 0.4% |
Other Symbol
| Value | Count | Frequency (%) |
| ® | 1024 | |
| ¦ | 741 | |
| ™ | 713 | |
| № | 578 | |
| © | 338 | 10.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ‚ | 759 | |
| ( | 630 | |
| „ | 532 | |
| { | 43 | 2.2% |
| [ | 21 | 1.1% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 646 | |
| « | 443 | |
| “ | 282 | |
| ‹ | 259 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 494 | |
| ” | 309 | |
| » | 237 | |
| › | 236 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4990 | |
| — | 197 | 3.7% |
| – | 175 | 3.3% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 4100 | |
| $ | 913 | 16.5% |
| € | 510 | 9.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 721 | |
| } | 44 | 5.6% |
| ] | 20 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1029514 | ||
| 667 | 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 62 | |
| ` | 24 | 27.9% |
Control
| Value | Count | Frequency (%) |
| | 1526 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 754 |
Format
| Value | Count | Frequency (%) |
| | 581 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10509756 | |
| Common | 1306248 | 11.0% |
| Cyrillic | 45800 | 0.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1029514 | ||
| . | 96377 | 7.4% |
| 2 | 34009 | 2.6% |
| 1 | 23029 | 1.8% |
| 3 | 14742 | 1.1% |
| 9 | 12031 | 0.9% |
| 7 | 11960 | 0.9% |
| 0 | 11880 | 0.9% |
| 4 | 9304 | 0.7% |
| 5 | 7774 | 0.6% |
| Other values (64) | 55628 | 4.3% |
Cyrillic
| Value | Count | Frequency (%) |
| Г | 8505 | |
| а | 5079 | 11.1% |
| џ | 2861 | 6.2% |
| р | 2273 | 5.0% |
| Ш | 2107 | 4.6% |
| в | 1901 | 4.2% |
| Е | 1682 | 3.7% |
| Щ | 1562 | 3.4% |
| Ґ | 1352 | 3.0% |
| ё | 1263 | 2.8% |
| Other values (59) | 17215 |
Latin
| Value | Count | Frequency (%) |
| a | 1908336 | |
| i | 733298 | 7.0% |
| n | 639341 | 6.1% |
| r | 604993 | 5.8% |
| h | 604453 | 5.8% |
| e | 514653 | 4.9% |
| u | 455189 | 4.3% |
| d | 357690 | 3.4% |
| s | 337274 | 3.2% |
| S | 319575 | 3.0% |
| Other values (42) | 4034954 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11794310 | |
| Cyrillic | 45800 | 0.4% |
| None | 13657 | 0.1% |
| Punctuation | 6236 | 0.1% |
| Letterlike Symbols | 1291 | < 0.1% |
| Currency Symbols | 510 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1908336 | |
| 1029514 | 8.7% | |
| i | 733298 | 6.2% |
| n | 639341 | 5.4% |
| r | 604993 | 5.1% |
| h | 604453 | 5.1% |
| e | 514653 | 4.4% |
| u | 455189 | 3.9% |
| d | 357690 | 3.0% |
| s | 337274 | 2.9% |
| Other values (83) | 4609569 |
Cyrillic
| Value | Count | Frequency (%) |
| Г | 8505 | |
| а | 5079 | 11.1% |
| џ | 2861 | 6.2% |
| р | 2273 | 5.0% |
| Ш | 2107 | 4.6% |
| в | 1901 | 4.2% |
| Е | 1682 | 3.7% |
| Щ | 1562 | 3.4% |
| Ґ | 1352 | 3.0% |
| ё | 1263 | 2.8% |
| Other values (59) | 17215 |
None
| Value | Count | Frequency (%) |
| ¤ | 4100 | |
| | 1526 | 11.2% |
| ± | 1492 | 10.9% |
| ® | 1024 | 7.5% |
| § | 924 | 6.8% |
| ¦ | 741 | 5.4% |
| ¬ | 696 | 5.1% |
| 667 | 4.9% | |
| | 581 | 4.3% |
| « | 443 | 3.2% |
| Other values (5) | 1463 | 10.7% |
Punctuation
| Value | Count | Frequency (%) |
| ‚ | 759 | |
| … | 680 | |
| ‘ | 646 | |
| ‡ | 539 | |
| „ | 532 | |
| • | 524 | |
| ’ | 494 | |
| † | 478 | |
| ” | 309 | 5.0% |
| “ | 282 | 4.5% |
| Other values (5) | 993 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 713 | |
| № | 578 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 510 |
Gender
Text
MISSING 
| Distinct | 2005 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 1006524 |
| Missing (%) | 96.0% |
| Memory size | 8.0 MiB |
Length
| Max length | 57 |
|---|---|
| Median length | 4 |
| Mean length | 4.4158522 |
| Min length | 1 |
Characters and Unicode
| Total characters | 185691 |
|---|---|
| Distinct characters | 132 |
| Distinct categories | 15 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 1806 ? |
|---|---|
| Unique (%) | 4.3% |
Sample
| 1st row | MALE |
|---|---|
| 2nd row | MALE |
| 3rd row | MALE |
| 4th row | FEMALE |
| 5th row | MALE |
| Value | Count | Frequency (%) |
| male | 32674 | |
| female | 6082 | 14.2% |
| v | 153 | 0.4% |
| 2 | 139 | 0.3% |
| i | 75 | 0.2% |
| m | 75 | 0.2% |
| p | 72 | 0.2% |
| s | 70 | 0.2% |
| k | 62 | 0.1% |
| r | 57 | 0.1% |
| Other values (1983) | 3301 | 7.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 44863 | |
| M | 38985 | |
| A | 38901 | |
| L | 38812 | |
| F | 6126 | 3.3% |
| 2544 | 1.4% | |
| a | 2362 | 1.3% |
| r | 1014 | 0.5% |
| n | 800 | 0.4% |
| i | 788 | 0.4% |
| Other values (122) | 10496 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 169701 | |
| Lowercase Letter | 12231 | 6.6% |
| Space Separator | 2547 | 1.4% |
| Decimal Number | 703 | 0.4% |
| Other Punctuation | 417 | 0.2% |
| Dash Punctuation | 27 | < 0.1% |
| Other Symbol | 17 | < 0.1% |
| Currency Symbol | 16 | < 0.1% |
| Initial Punctuation | 11 | < 0.1% |
| Open Punctuation | 6 | < 0.1% |
| Other values (5) | 15 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 44863 | |
| M | 38985 | |
| A | 38901 | |
| L | 38812 | |
| F | 6126 | 3.6% |
| S | 227 | 0.1% |
| V | 210 | 0.1% |
| R | 176 | 0.1% |
| B | 165 | 0.1% |
| P | 162 | 0.1% |
| Other values (34) | 1074 | 0.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2362 | |
| r | 1014 | 8.3% |
| n | 800 | 6.5% |
| i | 788 | 6.4% |
| e | 721 | 5.9% |
| d | 596 | 4.9% |
| s | 594 | 4.9% |
| u | 587 | 4.8% |
| h | 569 | 4.7% |
| l | 519 | 4.2% |
| Other values (29) | 3681 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 333 | |
| @ | 16 | 3.8% |
| & | 13 | 3.1% |
| ? | 10 | 2.4% |
| * | 10 | 2.4% |
| ' | 9 | 2.2% |
| : | 8 | 1.9% |
| ; | 4 | 1.0% |
| § | 3 | 0.7% |
| • | 3 | 0.7% |
| Other values (5) | 8 | 1.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 204 | |
| 1 | 119 | |
| 3 | 79 | 11.2% |
| 0 | 77 | 11.0% |
| 7 | 63 | 9.0% |
| 4 | 46 | 6.5% |
| 8 | 35 | 5.0% |
| 5 | 29 | 4.1% |
| 9 | 27 | 3.8% |
| 6 | 24 | 3.4% |
Other Symbol
| Value | Count | Frequency (%) |
| © | 9 | |
| ™ | 4 | |
| ® | 2 | 11.8% |
| ¦ | 1 | 5.9% |
| № | 1 | 5.9% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 8 | |
| « | 2 | 18.2% |
| “ | 1 | 9.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 7 | |
| € | 5 | |
| $ | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| „ | 3 | |
| ( | 2 | |
| ‚ | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 2544 | ||
| 3 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 24 | |
| – | 3 | 11.1% |
Math Symbol
| Value | Count | Frequency (%) |
| ¬ | 3 | |
| ± | 2 |
Control
| Value | Count | Frequency (%) |
| | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Format
| Value | Count | Frequency (%) |
| | 2 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 181766 | |
| Common | 3760 | 2.0% |
| Cyrillic | 165 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 44863 | |
| M | 38985 | |
| A | 38901 | |
| L | 38812 | |
| F | 6126 | 3.4% |
| a | 2362 | 1.3% |
| r | 1014 | 0.6% |
| n | 800 | 0.4% |
| i | 788 | 0.4% |
| e | 721 | 0.4% |
| Other values (42) | 8394 | 4.6% |
Common
| Value | Count | Frequency (%) |
| 2544 | ||
| . | 333 | 8.9% |
| 2 | 204 | 5.4% |
| 1 | 119 | 3.2% |
| 3 | 79 | 2.1% |
| 0 | 77 | 2.0% |
| 7 | 63 | 1.7% |
| 4 | 46 | 1.2% |
| 8 | 35 | 0.9% |
| 5 | 29 | 0.8% |
| Other values (40) | 231 | 6.1% |
Cyrillic
| Value | Count | Frequency (%) |
| Г | 19 | 11.5% |
| џ | 18 | 10.9% |
| р | 15 | 9.1% |
| в | 14 | 8.5% |
| а | 12 | 7.3% |
| Щ | 12 | 7.3% |
| Ђ | 9 | 5.5% |
| Ш | 8 | 4.8% |
| ў | 5 | 3.0% |
| Ќ | 5 | 3.0% |
| Other values (20) | 48 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 185449 | |
| Cyrillic | 165 | 0.1% |
| None | 41 | < 0.1% |
| Punctuation | 26 | < 0.1% |
| Currency Symbols | 5 | < 0.1% |
| Letterlike Symbols | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 44863 | |
| M | 38985 | |
| A | 38901 | |
| L | 38812 | |
| F | 6126 | 3.3% |
| 2544 | 1.4% | |
| a | 2362 | 1.3% |
| r | 1014 | 0.5% |
| n | 800 | 0.4% |
| i | 788 | 0.4% |
| Other values (65) | 10254 | 5.5% |
Cyrillic
| Value | Count | Frequency (%) |
| Г | 19 | 11.5% |
| џ | 18 | 10.9% |
| р | 15 | 9.1% |
| в | 14 | 8.5% |
| а | 12 | 7.3% |
| Щ | 12 | 7.3% |
| Ђ | 9 | 5.5% |
| Ш | 8 | 4.8% |
| ў | 5 | 3.0% |
| Ќ | 5 | 3.0% |
| Other values (20) | 48 |
None
| Value | Count | Frequency (%) |
| © | 9 | |
| ¤ | 7 | |
| | 3 | 7.3% |
| § | 3 | 7.3% |
| 3 | 7.3% | |
| ¬ | 3 | 7.3% |
| ® | 2 | 4.9% |
| « | 2 | 4.9% |
| | 2 | 4.9% |
| ± | 2 | 4.9% |
| Other values (4) | 5 |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 8 | |
| „ | 3 | 11.5% |
| – | 3 | 11.5% |
| • | 3 | 11.5% |
| † | 3 | 11.5% |
| ’ | 2 | 7.7% |
| ‚ | 1 | 3.8% |
| ‡ | 1 | 3.8% |
| … | 1 | 3.8% |
| “ | 1 | 3.8% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 5 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 4 | |
| № | 1 | 20.0% |
Address
Text
| Distinct | 6282 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 3192 |
| Missing (%) | 0.3% |
| Memory size | 8.0 MiB |
Length
| Max length | 101 |
|---|---|
| Median length | 15 |
| Mean length | 15.649854 |
| Min length | 1 |
Characters and Unicode
| Total characters | 16360091 |
|---|---|
| Distinct characters | 158 |
| Distinct categories | 16 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 5610 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Andhra Pradesh |
|---|---|
| 2nd row | Andhra Pradesh |
| 3rd row | Andhra Pradesh |
| 4th row | Andhra Pradesh |
| 5th row | Andhra Pradesh in |
| Value | Count | Frequency (%) |
| andhra | 998753 | |
| pradesh | 998753 | |
| in | 490053 | |
| local | 9411 | 0.4% |
| ahmedabad | 9410 | 0.4% |
| hyderabad | 6760 | 0.3% |
| visakhapatnam | 799 | < 0.1% |
| vijayawada | 544 | < 0.1% |
| guntur | 458 | < 0.1% |
| vizag | 448 | < 0.1% |
| Other values (5871) | 18904 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2071118 | |
| 2044440 | ||
| d | 2036339 | |
| r | 2014374 | |
| h | 2012117 | |
| n | 1498562 | |
| e | 1021645 | |
| A | 1009549 | |
| s | 1001767 | |
| P | 999424 | |
| Other values (148) | 650756 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12269097 | |
| Space Separator | 2044445 | 12.5% |
| Uppercase Letter | 2041322 | 12.5% |
| Decimal Number | 3524 | < 0.1% |
| Other Punctuation | 924 | < 0.1% |
| Dash Punctuation | 309 | < 0.1% |
| Open Punctuation | 146 | < 0.1% |
| Close Punctuation | 128 | < 0.1% |
| Math Symbol | 45 | < 0.1% |
| Other Symbol | 39 | < 0.1% |
| Other values (6) | 112 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1009549 | |
| P | 999424 | |
| L | 9741 | 0.5% |
| H | 7381 | 0.4% |
| V | 2286 | 0.1% |
| K | 1637 | 0.1% |
| N | 1435 | 0.1% |
| S | 1151 | 0.1% |
| B | 1063 | 0.1% |
| M | 1015 | < 0.1% |
| Other values (39) | 6640 | 0.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2071118 | |
| d | 2036339 | |
| r | 2014374 | |
| h | 2012117 | |
| n | 1498562 | |
| e | 1021645 | |
| s | 1001767 | |
| i | 499196 | 4.1% |
| b | 17964 | 0.1% |
| l | 16856 | 0.1% |
| Other values (33) | 79159 | 0.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 613 | |
| / | 155 | 16.8% |
| : | 46 | 5.0% |
| * | 19 | 2.1% |
| @ | 15 | 1.6% |
| # | 13 | 1.4% |
| ' | 10 | 1.1% |
| & | 9 | 1.0% |
| § | 9 | 1.0% |
| • | 7 | 0.8% |
| Other values (11) | 28 | 3.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 837 | |
| 1 | 536 | |
| 5 | 512 | |
| 2 | 404 | |
| 3 | 317 | 9.0% |
| 4 | 231 | 6.6% |
| 6 | 197 | 5.6% |
| 8 | 186 | 5.3% |
| 7 | 170 | 4.8% |
| 9 | 134 | 3.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 128 | |
| ‚ | 13 | 8.9% |
| „ | 3 | 2.1% |
| { | 1 | 0.7% |
| [ | 1 | 0.7% |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 32 | |
| ¬ | 7 | 15.6% |
| + | 3 | 6.7% |
| = | 2 | 4.4% |
| ~ | 1 | 2.2% |
Other Symbol
| Value | Count | Frequency (%) |
| ¦ | 18 | |
| № | 12 | |
| ™ | 4 | 10.3% |
| ® | 4 | 10.3% |
| © | 1 | 2.6% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 6 | |
| ‘ | 4 | |
| « | 3 | |
| ‹ | 1 | 7.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 126 | |
| ] | 1 | 0.8% |
| } | 1 | 0.8% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 24 | |
| ” | 5 | 16.1% |
| › | 2 | 6.5% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 21 | |
| € | 5 | 17.2% |
| $ | 3 | 10.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2044440 | ||
| 5 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 308 | |
| – | 1 | 0.3% |
Control
| Value | Count | Frequency (%) |
| | 23 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 8 |
Format
| Value | Count | Frequency (%) |
| | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14309985 | |
| Common | 2049672 | 12.5% |
| Cyrillic | 434 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2044440 | ||
| 0 | 837 | < 0.1% |
| . | 613 | < 0.1% |
| 1 | 536 | < 0.1% |
| 5 | 512 | < 0.1% |
| 2 | 404 | < 0.1% |
| 3 | 317 | < 0.1% |
| - | 308 | < 0.1% |
| 4 | 231 | < 0.1% |
| 6 | 197 | < 0.1% |
| Other values (56) | 1277 | 0.1% |
Latin
| Value | Count | Frequency (%) |
| a | 2071118 | |
| d | 2036339 | |
| r | 2014374 | |
| h | 2012117 | |
| n | 1498562 | |
| e | 1021645 | |
| A | 1009549 | |
| s | 1001767 | |
| P | 999424 | |
| i | 499196 | 3.5% |
| Other values (42) | 145894 | 1.0% |
Cyrillic
| Value | Count | Frequency (%) |
| џ | 47 | 10.8% |
| р | 45 | 10.4% |
| О | 36 | 8.3% |
| а | 33 | 7.6% |
| Г | 31 | 7.1% |
| в | 26 | 6.0% |
| ё | 19 | 4.4% |
| Ш | 18 | 4.1% |
| С | 18 | 4.1% |
| Џ | 14 | 3.2% |
| Other values (30) | 147 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16359426 | |
| Cyrillic | 434 | < 0.1% |
| None | 135 | < 0.1% |
| Punctuation | 75 | < 0.1% |
| Letterlike Symbols | 16 | < 0.1% |
| Currency Symbols | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2071118 | |
| 2044440 | ||
| d | 2036339 | |
| r | 2014374 | |
| h | 2012117 | |
| n | 1498562 | |
| e | 1021645 | |
| A | 1009549 | |
| s | 1001767 | |
| P | 999424 | |
| Other values (78) | 650091 | 4.0% |
Cyrillic
| Value | Count | Frequency (%) |
| џ | 47 | 10.8% |
| р | 45 | 10.4% |
| О | 36 | 8.3% |
| а | 33 | 7.6% |
| Г | 31 | 7.1% |
| в | 26 | 6.0% |
| ё | 19 | 4.4% |
| Ш | 18 | 4.1% |
| С | 18 | 4.1% |
| Џ | 14 | 3.2% |
| Other values (30) | 147 |
None
| Value | Count | Frequency (%) |
| ± | 32 | |
| | 23 | |
| ¤ | 21 | |
| ¦ | 18 | |
| § | 9 | 6.7% |
| | 7 | 5.2% |
| ¬ | 7 | 5.2% |
| 5 | 3.7% | |
| ¶ | 4 | 3.0% |
| ® | 4 | 3.0% |
| Other values (3) | 5 | 3.7% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 24 | |
| ‚ | 13 | |
| • | 7 | 9.3% |
| “ | 6 | 8.0% |
| ” | 5 | 6.7% |
| ‘ | 4 | 5.3% |
| ‰ | 3 | 4.0% |
| ‡ | 3 | 4.0% |
| „ | 3 | 4.0% |
| † | 2 | 2.7% |
| Other values (4) | 5 | 6.7% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| № | 12 | |
| ™ | 4 | 25.0% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 5 |
JobTitle
Text
MISSING 
| Distinct | 6469 |
|---|---|
| Distinct (%) | 20.5% |
| Missing | 1017036 |
| Missing (%) | 97.0% |
| Memory size | 8.0 MiB |
Length
| Max length | 100 |
|---|---|
| Median length | 79 |
| Mean length | 9.5300739 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300569 |
|---|---|
| Distinct characters | 166 |
| Distinct categories | 17 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 5261 ? |
|---|---|
| Unique (%) | 16.7% |
Sample
| 1st row | Andhra Pradesh proddatur |
|---|---|
| 2nd row | India New Delhi |
| 3rd row | 500052 |
| 4th row | in |
| 5th row | Andhra Pradesh |
| Value | Count | Frequency (%) |
| in | 9922 | |
| gujarat | 9411 | |
| pradesh | 4313 | 9.4% |
| andhra | 4298 | 9.3% |
| india | 1790 | 3.9% |
| hyderabad | 1428 | 3.1% |
| vijayawada | 310 | 0.7% |
| warangal | 186 | 0.4% |
| karimnagar | 169 | 0.4% |
| bangalore | 167 | 0.4% |
| Other values (5772) | 13991 |
Most occurring characters
| Value | Count | Frequency (%) |
| 43768 | ||
| a | 43685 | |
| r | 24928 | 8.3% |
| n | 20806 | 6.9% |
| i | 16604 | 5.5% |
| d | 15989 | 5.3% |
| t | 12952 | 4.3% |
| u | 11901 | 4.0% |
| e | 10987 | 3.7% |
| h | 10822 | 3.6% |
| Other values (156) | 88127 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 208363 | |
| Space Separator | 43775 | 14.6% |
| Uppercase Letter | 32192 | 10.7% |
| Decimal Number | 14845 | 4.9% |
| Other Punctuation | 843 | 0.3% |
| Open Punctuation | 83 | < 0.1% |
| Close Punctuation | 72 | < 0.1% |
| Dash Punctuation | 70 | < 0.1% |
| Control | 66 | < 0.1% |
| Currency Symbol | 56 | < 0.1% |
| Other values (7) | 204 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 9735 | |
| A | 5265 | |
| P | 4748 | |
| I | 2123 | 6.6% |
| H | 1801 | 5.6% |
| S | 838 | 2.6% |
| M | 743 | 2.3% |
| N | 658 | 2.0% |
| C | 602 | 1.9% |
| T | 591 | 1.8% |
| Other values (44) | 5088 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 43685 | |
| r | 24928 | |
| n | 20806 | |
| i | 16604 | 8.0% |
| d | 15989 | 7.7% |
| t | 12952 | 6.2% |
| u | 11901 | 5.7% |
| e | 10987 | 5.3% |
| h | 10822 | 5.2% |
| j | 10000 | 4.8% |
| Other values (37) | 29689 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 577 | |
| @ | 47 | 5.6% |
| ' | 37 | 4.4% |
| & | 35 | 4.2% |
| * | 28 | 3.3% |
| / | 24 | 2.8% |
| # | 19 | 2.3% |
| ? | 15 | 1.8% |
| • | 13 | 1.5% |
| ! | 11 | 1.3% |
| Other values (10) | 37 | 4.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4147 | |
| 5 | 2978 | |
| 1 | 1792 | |
| 2 | 1642 | 11.1% |
| 3 | 1371 | 9.2% |
| 4 | 829 | 5.6% |
| 6 | 645 | 4.3% |
| 7 | 584 | 3.9% |
| 8 | 506 | 3.4% |
| 9 | 351 | 2.4% |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 21 | |
| ¦ | 9 | |
| ® | 6 | 13.6% |
| © | 5 | 11.4% |
| № | 3 | 6.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 70 | |
| „ | 7 | 8.4% |
| ‚ | 5 | 6.0% |
| { | 1 | 1.2% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 21 | |
| ” | 15 | |
| › | 6 | 13.6% |
| » | 2 | 4.5% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 20 | |
| ‘ | 18 | |
| « | 9 | |
| ‹ | 7 | 13.0% |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 16 | |
| ¬ | 2 | 10.0% |
| ~ | 1 | 5.0% |
| + | 1 | 5.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 58 | |
| – | 7 | 10.0% |
| — | 5 | 7.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 36 | |
| $ | 19 | |
| € | 1 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 43768 | ||
| 7 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 71 | |
| } | 1 | 1.4% |
Control
| Value | Count | Frequency (%) |
| | 66 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 37 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 4 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 239686 | |
| Common | 60015 | 20.0% |
| Cyrillic | 868 | 0.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 43768 | ||
| 0 | 4147 | 6.9% |
| 5 | 2978 | 5.0% |
| 1 | 1792 | 3.0% |
| 2 | 1642 | 2.7% |
| 3 | 1371 | 2.3% |
| 4 | 829 | 1.4% |
| 6 | 645 | 1.1% |
| 7 | 584 | 1.0% |
| . | 577 | 1.0% |
| Other values (56) | 1682 | 2.8% |
Latin
| Value | Count | Frequency (%) |
| a | 43685 | |
| r | 24928 | |
| n | 20806 | 8.7% |
| i | 16604 | 6.9% |
| d | 15989 | 6.7% |
| t | 12952 | 5.4% |
| u | 11901 | 5.0% |
| e | 10987 | 4.6% |
| h | 10822 | 4.5% |
| j | 10000 | 4.2% |
| Other values (42) | 61012 |
Cyrillic
| Value | Count | Frequency (%) |
| џ | 151 | |
| р | 137 | |
| Г | 76 | 8.8% |
| в | 60 | 6.9% |
| ђ | 30 | 3.5% |
| Д | 25 | 2.9% |
| а | 23 | 2.6% |
| Џ | 22 | 2.5% |
| Љ | 21 | 2.4% |
| ё | 21 | 2.4% |
| Other values (38) | 302 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 299363 | |
| Cyrillic | 868 | 0.3% |
| None | 170 | 0.1% |
| Punctuation | 143 | < 0.1% |
| Letterlike Symbols | 24 | < 0.1% |
| Currency Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 43768 | ||
| a | 43685 | |
| r | 24928 | 8.3% |
| n | 20806 | 7.0% |
| i | 16604 | 5.5% |
| d | 15989 | 5.3% |
| t | 12952 | 4.3% |
| u | 11901 | 4.0% |
| e | 10987 | 3.7% |
| h | 10822 | 3.6% |
| Other values (75) | 86921 |
Cyrillic
| Value | Count | Frequency (%) |
| џ | 151 | |
| р | 137 | |
| Г | 76 | 8.8% |
| в | 60 | 6.9% |
| ђ | 30 | 3.5% |
| Д | 25 | 2.9% |
| а | 23 | 2.6% |
| Џ | 22 | 2.5% |
| Љ | 21 | 2.4% |
| ё | 21 | 2.4% |
| Other values (38) | 302 |
None
| Value | Count | Frequency (%) |
| | 66 | |
| ¤ | 36 | |
| ± | 16 | 9.4% |
| « | 9 | 5.3% |
| ¦ | 9 | 5.3% |
| 7 | 4.1% | |
| ® | 6 | 3.5% |
| § | 6 | 3.5% |
| © | 5 | 2.9% |
| ¶ | 3 | 1.8% |
| Other values (5) | 7 | 4.1% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 21 | |
| № | 3 | 12.5% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 21 | |
| “ | 20 | |
| ‘ | 18 | |
| ” | 15 | |
| • | 13 | |
| † | 7 | 4.9% |
| „ | 7 | 4.9% |
| – | 7 | 4.9% |
| ‹ | 7 | 4.9% |
| › | 6 | 4.2% |
| Other values (5) | 22 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 |
CompanyName
Text
MISSING 
| Distinct | 6682 |
|---|---|
| Distinct (%) | 43.3% |
| Missing | 1033131 |
| Missing (%) | 98.5% |
| Memory size | 8.0 MiB |
Length
| Max length | 64 |
|---|---|
| Median length | 54 |
| Mean length | 9.3054908 |
| Min length | 1 |
Characters and Unicode
| Total characters | 143714 |
|---|---|
| Distinct characters | 169 |
| Distinct categories | 17 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 5747 ? |
|---|---|
| Unique (%) | 37.2% |
Sample
| 1st row | kadapa |
|---|---|
| 2nd row | India |
| 3rd row | Hyderabad |
| 4th row | Tirupati |
| 5th row | Karnataka |
| Value | Count | Frequency (%) |
| india | 2283 | 10.3% |
| in | 1892 | 8.6% |
| pradesh | 1241 | 5.6% |
| andhra | 1228 | 5.6% |
| hyderabad | 572 | 2.6% |
| ltd | 174 | 0.8% |
| student | 157 | 0.7% |
| 135 | 0.6% | |
| police | 110 | 0.5% |
| bank | 107 | 0.5% |
| Other values (6140) | 14198 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 16512 | 11.5% |
| 14749 | 10.3% | |
| n | 10576 | 7.4% |
| i | 9852 | 6.9% |
| d | 8758 | 6.1% |
| r | 8496 | 5.9% |
| e | 7608 | 5.3% |
| h | 4651 | 3.2% |
| t | 4643 | 3.2% |
| s | 4399 | 3.1% |
| Other values (159) | 53470 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 101384 | |
| Uppercase Letter | 24194 | 16.8% |
| Space Separator | 14754 | 10.3% |
| Decimal Number | 1904 | 1.3% |
| Other Punctuation | 1010 | 0.7% |
| Dash Punctuation | 75 | 0.1% |
| Open Punctuation | 73 | 0.1% |
| Control | 55 | < 0.1% |
| Other Symbol | 48 | < 0.1% |
| Close Punctuation | 42 | < 0.1% |
| Other values (7) | 175 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 3156 | |
| A | 2842 | 11.7% |
| P | 2343 | 9.7% |
| S | 1904 | 7.9% |
| H | 1324 | 5.5% |
| C | 1149 | 4.7% |
| T | 1082 | 4.5% |
| M | 1073 | 4.4% |
| R | 904 | 3.7% |
| N | 900 | 3.7% |
| Other values (44) | 7517 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 16512 | |
| n | 10576 | |
| i | 9852 | |
| d | 8758 | |
| r | 8496 | 8.4% |
| e | 7608 | 7.5% |
| h | 4651 | 4.6% |
| t | 4643 | 4.6% |
| s | 4399 | 4.3% |
| o | 4067 | 4.0% |
| Other values (35) | 21822 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 678 | |
| & | 103 | 10.2% |
| * | 40 | 4.0% |
| ' | 40 | 4.0% |
| @ | 30 | 3.0% |
| / | 21 | 2.1% |
| ? | 21 | 2.1% |
| : | 13 | 1.3% |
| • | 11 | 1.1% |
| § | 9 | 0.9% |
| Other values (9) | 44 | 4.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 474 | |
| 5 | 303 | |
| 1 | 231 | |
| 2 | 227 | |
| 3 | 183 | 9.6% |
| 4 | 125 | 6.6% |
| 7 | 104 | 5.5% |
| 9 | 91 | 4.8% |
| 6 | 90 | 4.7% |
| 8 | 76 | 4.0% |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 14 | |
| | | 8 | |
| = | 4 | 11.4% |
| ¬ | 4 | 11.4% |
| ~ | 2 | 5.7% |
| < | 1 | 2.9% |
| > | 1 | 2.9% |
| + | 1 | 2.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 38 | |
| ‚ | 22 | |
| „ | 8 | 11.0% |
| { | 3 | 4.1% |
| [ | 2 | 2.7% |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 21 | |
| № | 9 | |
| ® | 7 | 14.6% |
| ¦ | 6 | 12.5% |
| © | 5 | 10.4% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 28 | |
| ” | 9 | 23.1% |
| › | 1 | 2.6% |
| » | 1 | 2.6% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 18 | |
| ‹ | 6 | 20.0% |
| “ | 4 | 13.3% |
| « | 2 | 6.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 56 | |
| – | 13 | 17.3% |
| — | 6 | 8.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37 | |
| } | 3 | 7.1% |
| ] | 2 | 4.8% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 28 | |
| $ | 6 | 15.0% |
| € | 6 | 15.0% |
Space Separator
| Value | Count | Frequency (%) |
| 14749 | ||
| 5 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| | 55 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 20 |
Format
| Value | Count | Frequency (%) |
| | 10 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 124806 | |
| Common | 18136 | 12.6% |
| Cyrillic | 772 | 0.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 14749 | ||
| . | 678 | 3.7% |
| 0 | 474 | 2.6% |
| 5 | 303 | 1.7% |
| 1 | 231 | 1.3% |
| 2 | 227 | 1.3% |
| 3 | 183 | 1.0% |
| 4 | 125 | 0.7% |
| 7 | 104 | 0.6% |
| & | 103 | 0.6% |
| Other values (60) | 959 | 5.3% |
Latin
| Value | Count | Frequency (%) |
| a | 16512 | 13.2% |
| n | 10576 | 8.5% |
| i | 9852 | 7.9% |
| d | 8758 | 7.0% |
| r | 8496 | 6.8% |
| e | 7608 | 6.1% |
| h | 4651 | 3.7% |
| t | 4643 | 3.7% |
| s | 4399 | 3.5% |
| o | 4067 | 3.3% |
| Other values (42) | 45244 |
Cyrillic
| Value | Count | Frequency (%) |
| џ | 110 | 14.2% |
| р | 101 | 13.1% |
| в | 79 | 10.2% |
| Г | 63 | 8.2% |
| Џ | 33 | 4.3% |
| ё | 29 | 3.8% |
| п | 21 | 2.7% |
| С | 18 | 2.3% |
| а | 18 | 2.3% |
| О | 16 | 2.1% |
| Other values (37) | 284 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 142604 | |
| Cyrillic | 772 | 0.5% |
| Punctuation | 153 | 0.1% |
| None | 149 | 0.1% |
| Letterlike Symbols | 30 | < 0.1% |
| Currency Symbols | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 16512 | 11.6% |
| 14749 | 10.3% | |
| n | 10576 | 7.4% |
| i | 9852 | 6.9% |
| d | 8758 | 6.1% |
| r | 8496 | 6.0% |
| e | 7608 | 5.3% |
| h | 4651 | 3.3% |
| t | 4643 | 3.3% |
| s | 4399 | 3.1% |
| Other values (81) | 52360 |
Cyrillic
| Value | Count | Frequency (%) |
| џ | 110 | 14.2% |
| р | 101 | 13.1% |
| в | 79 | 10.2% |
| Г | 63 | 8.2% |
| Џ | 33 | 4.3% |
| ё | 29 | 3.8% |
| п | 21 | 2.7% |
| С | 18 | 2.3% |
| а | 18 | 2.3% |
| О | 16 | 2.1% |
| Other values (37) | 284 |
None
| Value | Count | Frequency (%) |
| | 55 | |
| ¤ | 28 | |
| ± | 14 | 9.4% |
| | 10 | 6.7% |
| § | 9 | 6.0% |
| ® | 7 | 4.7% |
| ¦ | 6 | 4.0% |
| © | 5 | 3.4% |
| 5 | 3.4% | |
| ¬ | 4 | 2.7% |
| Other values (3) | 6 | 4.0% |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 28 | |
| ‚ | 22 | |
| ‘ | 18 | |
| – | 13 | |
| • | 11 | 7.2% |
| † | 9 | 5.9% |
| ” | 9 | 5.9% |
| „ | 8 | 5.2% |
| ‡ | 8 | 5.2% |
| ‹ | 6 | 3.9% |
| Other values (5) | 21 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 21 | |
| № | 9 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 6 |
Email
Text
MISSING 
| Distinct | 258435 |
|---|---|
| Distinct (%) | 97.2% |
| Missing | 782691 |
| Missing (%) | 74.6% |
| Memory size | 8.0 MiB |
Length
| Max length | 52 |
|---|---|
| Median length | 45 |
| Mean length | 23.463988 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6238699 |
|---|---|
| Distinct characters | 117 |
| Distinct categories | 16 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 5 ? |
Unique
| Unique | 257384 ? |
|---|---|
| Unique (%) | 96.8% |
Sample
| 1st row | shekarchinnu749@gmail.com |
|---|---|
| 2nd row | subbaramireddy0@gmail.com |
| 3rd row | thativenkatesh2000@gmail.com |
| 4th row | manibhararth@gmail.com |
| 5th row | Andhra Pradesh |
| Value | Count | Frequency (%) |
| in | 5423 | 2.0% |
| pradesh | 511 | 0.2% |
| andhra | 510 | 0.2% |
| hyderabad | 74 | < 0.1% |
| abc@gmail.com | 37 | < 0.1% |
| student | 35 | < 0.1% |
| india | 34 | < 0.1% |
| ltd | 27 | < 0.1% |
| 22 | < 0.1% | |
| bank | 21 | < 0.1% |
| Other values (258541) | 260538 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 848680 | 13.6% |
| m | 634170 | 10.2% |
| i | 483816 | 7.8% |
| l | 360307 | 5.8% |
| o | 341022 | 5.5% |
| . | 305817 | 4.9% |
| g | 300173 | 4.8% |
| c | 290561 | 4.7% |
| @ | 258469 | 4.1% |
| r | 228923 | 3.7% |
| Other values (107) | 2186761 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5093759 | |
| Other Punctuation | 564339 | 9.0% |
| Decimal Number | 554687 | 8.9% |
| Uppercase Letter | 16044 | 0.3% |
| Space Separator | 7682 | 0.1% |
| Connector Punctuation | 1954 | < 0.1% |
| Dash Punctuation | 169 | < 0.1% |
| Close Punctuation | 13 | < 0.1% |
| Open Punctuation | 12 | < 0.1% |
| Math Symbol | 11 | < 0.1% |
| Other values (6) | 29 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2081 | |
| S | 1678 | 10.5% |
| R | 1211 | 7.5% |
| P | 1183 | 7.4% |
| M | 1174 | 7.3% |
| G | 1012 | 6.3% |
| N | 848 | 5.3% |
| K | 738 | 4.6% |
| I | 634 | 4.0% |
| C | 609 | 3.8% |
| Other values (28) | 4876 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 848680 | |
| m | 634170 | |
| i | 483816 | |
| l | 360307 | 7.1% |
| o | 341022 | 6.7% |
| g | 300173 | 5.9% |
| c | 290561 | 5.7% |
| r | 228923 | 4.5% |
| n | 214377 | 4.2% |
| h | 196704 | 3.9% |
| Other values (27) | 1195026 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 305817 | |
| @ | 258469 | |
| & | 22 | < 0.1% |
| ! | 10 | < 0.1% |
| ' | 8 | < 0.1% |
| * | 5 | < 0.1% |
| : | 2 | < 0.1% |
| / | 2 | < 0.1% |
| ¶ | 1 | < 0.1% |
| § | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 83164 | |
| 9 | 70252 | |
| 2 | 64631 | |
| 0 | 61262 | |
| 3 | 54000 | |
| 7 | 50558 | |
| 4 | 46312 | |
| 8 | 43612 | |
| 5 | 40637 | |
| 6 | 40259 |
Math Symbol
| Value | Count | Frequency (%) |
| ¬ | 4 | |
| ± | 4 | |
| + | 2 | |
| | | 1 | 9.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 | |
| ” | 2 | 22.2% |
| › | 1 | 11.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 168 | |
| – | 1 | 0.6% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 4 | |
| ¤ | 1 | 20.0% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 4 | |
| ‘ | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 7682 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1954 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 |
Control
| Value | Count | Frequency (%) |
| | 5 |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 3 |
Format
| Value | Count | Frequency (%) |
| | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5109713 | |
| Common | 1128896 | 18.1% |
| Cyrillic | 90 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 848680 | |
| m | 634170 | |
| i | 483816 | 9.5% |
| l | 360307 | 7.1% |
| o | 341022 | 6.7% |
| g | 300173 | 5.9% |
| c | 290561 | 5.7% |
| r | 228923 | 4.5% |
| n | 214377 | 4.2% |
| h | 196704 | 3.8% |
| Other values (42) | 1210980 |
Common
| Value | Count | Frequency (%) |
| . | 305817 | |
| @ | 258469 | |
| 1 | 83164 | 7.4% |
| 9 | 70252 | 6.2% |
| 2 | 64631 | 5.7% |
| 0 | 61262 | 5.4% |
| 3 | 54000 | 4.8% |
| 7 | 50558 | 4.5% |
| 4 | 46312 | 4.1% |
| 8 | 43612 | 3.9% |
| Other values (32) | 90819 | 8.0% |
Cyrillic
| Value | Count | Frequency (%) |
| џ | 18 | |
| р | 15 | |
| Д | 10 | |
| Г | 5 | 5.6% |
| Ќ | 5 | 5.6% |
| Џ | 4 | 4.4% |
| Е | 4 | 4.4% |
| Ђ | 3 | 3.3% |
| а | 3 | 3.3% |
| О | 3 | 3.3% |
| Other values (13) | 20 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6238572 | |
| Cyrillic | 90 | < 0.1% |
| None | 22 | < 0.1% |
| Punctuation | 12 | < 0.1% |
| Letterlike Symbols | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 848680 | 13.6% |
| m | 634170 | 10.2% |
| i | 483816 | 7.8% |
| l | 360307 | 5.8% |
| o | 341022 | 5.5% |
| . | 305817 | 4.9% |
| g | 300173 | 4.8% |
| c | 290561 | 4.7% |
| @ | 258469 | 4.1% |
| r | 228923 | 3.7% |
| Other values (69) | 2186634 |
Cyrillic
| Value | Count | Frequency (%) |
| џ | 18 | |
| р | 15 | |
| Д | 10 | |
| Г | 5 | 5.6% |
| Ќ | 5 | 5.6% |
| Џ | 4 | 4.4% |
| Е | 4 | 4.4% |
| Ђ | 3 | 3.3% |
| а | 3 | 3.3% |
| О | 3 | 3.3% |
| Other values (13) | 20 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 6 | |
| ” | 2 | 16.7% |
| ‘ | 1 | 8.3% |
| › | 1 | 8.3% |
| – | 1 | 8.3% |
| • | 1 | 8.3% |
None
| Value | Count | Frequency (%) |
| | 5 | |
| ¬ | 4 | |
| ± | 4 | |
| « | 4 | |
| | 2 | 9.1% |
| ¤ | 1 | 4.5% |
| ¶ | 1 | 4.5% |
| § | 1 | 4.5% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 3 |
Facebook
Text
MISSING 
| Distinct | 16176 |
|---|---|
| Distinct (%) | 83.5% |
| Missing | 1029196 |
| Missing (%) | 98.2% |
| Memory size | 8.0 MiB |
Length
| Max length | 173 |
|---|---|
| Median length | 11 |
| Mean length | 11.663192 |
| Min length | 1 |
Characters and Unicode
| Total characters | 226021 |
|---|---|
| Distinct characters | 126 |
| Distinct categories | 15 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 15896 ? |
|---|---|
| Unique (%) | 82.0% |
Sample
| 1st row | 1.54118E+15 |
|---|---|
| 2nd row | 5.27008E+14 |
| 3rd row | Tours&travels |
| 4th row | 1.00003E+14 |
| 5th row | 5.42543E+14 |
| Value | Count | Frequency (%) |
| in | 645 | 3.2% |
| 1.00002e+14 | 440 | 2.2% |
| 1.00001e+14 | 432 | 2.1% |
| 1.00003e+14 | 368 | 1.8% |
| 1.00004e+14 | 251 | 1.2% |
| 1e+14 | 181 | 0.9% |
| 1.00005e+14 | 174 | 0.9% |
| 1.00006e+14 | 140 | 0.7% |
| 1.00007e+14 | 111 | 0.5% |
| 1.00008e+14 | 90 | 0.4% |
| Other values (16459) | 17488 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 32225 | |
| 4 | 21267 | 9.4% |
| . | 17253 | 7.6% |
| E | 15572 | 6.9% |
| + | 15416 | 6.8% |
| 0 | 14688 | 6.5% |
| 2 | 11732 | 5.2% |
| 5 | 10906 | 4.8% |
| 3 | 8685 | 3.8% |
| 6 | 8343 | 3.7% |
| Other values (116) | 69934 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 130437 | |
| Lowercase Letter | 41131 | 18.2% |
| Other Punctuation | 18803 | 8.3% |
| Uppercase Letter | 18480 | 8.2% |
| Math Symbol | 15419 | 6.8% |
| Space Separator | 1678 | 0.7% |
| Open Punctuation | 15 | < 0.1% |
| Connector Punctuation | 12 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
| Other Symbol | 9 | < 0.1% |
| Other values (5) | 27 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 15572 | |
| S | 303 | 1.6% |
| A | 254 | 1.4% |
| I | 193 | 1.0% |
| C | 189 | 1.0% |
| P | 182 | 1.0% |
| R | 168 | 0.9% |
| M | 158 | 0.9% |
| T | 154 | 0.8% |
| L | 151 | 0.8% |
| Other values (32) | 1156 | 6.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5961 | |
| i | 4179 | 10.2% |
| m | 3920 | 9.5% |
| o | 2750 | 6.7% |
| n | 2613 | 6.4% |
| l | 2551 | 6.2% |
| r | 2067 | 5.0% |
| c | 2064 | 5.0% |
| e | 1990 | 4.8% |
| g | 1967 | 4.8% |
| Other values (29) | 11069 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 17253 | |
| @ | 1457 | 7.7% |
| / | 48 | 0.3% |
| & | 16 | 0.1% |
| : | 10 | 0.1% |
| ' | 6 | < 0.1% |
| § | 5 | < 0.1% |
| * | 2 | < 0.1% |
| † | 2 | < 0.1% |
| • | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 32225 | |
| 4 | 21267 | |
| 0 | 14688 | |
| 2 | 11732 | 9.0% |
| 5 | 10906 | 8.4% |
| 3 | 8685 | 6.7% |
| 6 | 8343 | 6.4% |
| 7 | 7645 | 5.9% |
| 9 | 7520 | 5.8% |
| 8 | 7426 | 5.7% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 15416 | |
| ± | 2 | < 0.1% |
| = | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10 | |
| ‚ | 4 | 26.7% |
| „ | 1 | 6.7% |
Other Symbol
| Value | Count | Frequency (%) |
| ¦ | 6 | |
| ™ | 2 | 22.2% |
| ® | 1 | 11.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 3 | |
| € | 1 | 20.0% |
| ¤ | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1677 | ||
| 1 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7 | |
| – | 1 | 12.5% |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 5 | |
| ‹ | 2 | 28.6% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1 | |
| ” | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 12 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Control
| Value | Count | Frequency (%) |
| | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 166410 | |
| Latin | 59500 | 26.3% |
| Cyrillic | 111 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 15572 | |
| a | 5961 | 10.0% |
| i | 4179 | 7.0% |
| m | 3920 | 6.6% |
| o | 2750 | 4.6% |
| n | 2613 | 4.4% |
| l | 2551 | 4.3% |
| r | 2067 | 3.5% |
| c | 2064 | 3.5% |
| e | 1990 | 3.3% |
| Other values (42) | 15833 |
Common
| Value | Count | Frequency (%) |
| 1 | 32225 | |
| 4 | 21267 | |
| . | 17253 | |
| + | 15416 | |
| 0 | 14688 | |
| 2 | 11732 | 7.1% |
| 5 | 10906 | 6.6% |
| 3 | 8685 | 5.2% |
| 6 | 8343 | 5.0% |
| 7 | 7645 | 4.6% |
| Other values (35) | 18250 |
Cyrillic
| Value | Count | Frequency (%) |
| Г | 18 | |
| џ | 13 | 11.7% |
| Е | 10 | 9.0% |
| р | 10 | 9.0% |
| а | 6 | 5.4% |
| в | 5 | 4.5% |
| Љ | 4 | 3.6% |
| Ш | 4 | 3.6% |
| Щ | 3 | 2.7% |
| Ї | 3 | 2.7% |
| Other values (19) | 35 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 225866 | |
| Cyrillic | 111 | < 0.1% |
| None | 21 | < 0.1% |
| Punctuation | 20 | < 0.1% |
| Letterlike Symbols | 2 | < 0.1% |
| Currency Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 32225 | |
| 4 | 21267 | 9.4% |
| . | 17253 | 7.6% |
| E | 15572 | 6.9% |
| + | 15416 | 6.8% |
| 0 | 14688 | 6.5% |
| 2 | 11732 | 5.2% |
| 5 | 10906 | 4.8% |
| 3 | 8685 | 3.8% |
| 6 | 8343 | 3.7% |
| Other values (68) | 69779 |
Cyrillic
| Value | Count | Frequency (%) |
| Г | 18 | |
| џ | 13 | 11.7% |
| Е | 10 | 9.0% |
| р | 10 | 9.0% |
| а | 6 | 5.4% |
| в | 5 | 4.5% |
| Љ | 4 | 3.6% |
| Ш | 4 | 3.6% |
| Щ | 3 | 2.7% |
| Ї | 3 | 2.7% |
| Other values (19) | 35 |
None
| Value | Count | Frequency (%) |
| ¦ | 6 | |
| § | 5 | |
| | 5 | |
| ± | 2 | 9.5% |
| ® | 1 | 4.8% |
| 1 | 4.8% | |
| ¤ | 1 | 4.8% |
Punctuation
| Value | Count | Frequency (%) |
| ‘ | 5 | |
| ‚ | 4 | |
| ‹ | 2 | 10.0% |
| † | 2 | 10.0% |
| • | 2 | 10.0% |
| ’ | 1 | 5.0% |
| „ | 1 | 5.0% |
| – | 1 | 5.0% |
| ” | 1 | 5.0% |
| ‡ | 1 | 5.0% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 2 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 |
Twitter
Text
MISSING 
| Distinct | 6731 |
|---|---|
| Distinct (%) | 85.6% |
| Missing | 1040714 |
| Missing (%) | 99.3% |
| Memory size | 8.0 MiB |
Length
| Max length | 57 |
|---|---|
| Median length | 43 |
| Mean length | 18.335581 |
| Min length | 1 |
Characters and Unicode
| Total characters | 144136 |
|---|---|
| Distinct characters | 82 |
| Distinct categories | 12 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 6691 ? |
|---|---|
| Unique (%) | 85.1% |
Sample
| 1st row | mdsaifuddin5732@gmail.com |
|---|---|
| 2nd row | 1.00001E+14 |
| 3rd row | j.ravikumar2217@gmail.com |
| 4th row | devarajag93@gmail.com |
| 5th row | maheshpushpa1992@gmail.com |
| Value | Count | Frequency (%) |
| 1.00002e+14 | 224 | 2.8% |
| 1.00001e+14 | 207 | 2.6% |
| 1.00003e+14 | 171 | 2.1% |
| 1.00004e+14 | 118 | 1.5% |
| in | 80 | 1.0% |
| 1.00007e+14 | 70 | 0.9% |
| 1.00005e+14 | 68 | 0.8% |
| 1e+14 | 59 | 0.7% |
| 1.00006e+14 | 55 | 0.7% |
| 1.00008e+14 | 47 | 0.6% |
| Other values (6837) | 6969 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 14733 | 10.2% |
| m | 11004 | 7.6% |
| i | 8414 | 5.8% |
| . | 8383 | 5.8% |
| 1 | 7249 | 5.0% |
| o | 6442 | 4.5% |
| l | 6196 | 4.3% |
| 0 | 5934 | 4.1% |
| c | 5295 | 3.7% |
| g | 5132 | 3.6% |
| Other values (72) | 65354 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 91503 | |
| Decimal Number | 32943 | 22.9% |
| Other Punctuation | 12939 | 9.0% |
| Uppercase Letter | 3555 | 2.5% |
| Math Symbol | 2765 | 1.9% |
| Space Separator | 323 | 0.2% |
| Connector Punctuation | 94 | 0.1% |
| Dash Punctuation | 6 | < 0.1% |
| Currency Symbol | 3 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 14733 | |
| m | 11004 | |
| i | 8414 | 9.2% |
| o | 6442 | 7.0% |
| l | 6196 | 6.8% |
| c | 5295 | 5.8% |
| g | 5132 | 5.6% |
| r | 4317 | 4.7% |
| n | 3962 | 4.3% |
| h | 3759 | 4.1% |
| Other values (17) | 22249 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2797 | |
| S | 92 | 2.6% |
| A | 85 | 2.4% |
| R | 57 | 1.6% |
| P | 53 | 1.5% |
| M | 50 | 1.4% |
| C | 40 | 1.1% |
| I | 38 | 1.1% |
| T | 36 | 1.0% |
| N | 35 | 1.0% |
| Other values (17) | 272 | 7.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7249 | |
| 0 | 5934 | |
| 4 | 4088 | |
| 5 | 2674 | 8.1% |
| 2 | 2511 | 7.6% |
| 9 | 2324 | 7.1% |
| 3 | 2169 | 6.6% |
| 7 | 2039 | 6.2% |
| 8 | 1994 | 6.1% |
| 6 | 1961 | 6.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8383 | |
| @ | 4517 | |
| / | 25 | 0.2% |
| : | 8 | 0.1% |
| & | 3 | < 0.1% |
| ' | 2 | < 0.1% |
| # | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 | |
| ‚ | 1 | |
| „ | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2764 | |
| = | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 323 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 94 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‘ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 95051 | |
| Common | 49078 | |
| Cyrillic | 7 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 14733 | |
| m | 11004 | |
| i | 8414 | 8.9% |
| o | 6442 | 6.8% |
| l | 6196 | 6.5% |
| c | 5295 | 5.6% |
| g | 5132 | 5.4% |
| r | 4317 | 4.5% |
| n | 3962 | 4.2% |
| h | 3759 | 4.0% |
| Other values (41) | 25797 |
Common
| Value | Count | Frequency (%) |
| . | 8383 | |
| 1 | 7249 | |
| 0 | 5934 | |
| @ | 4517 | |
| 4 | 4088 | |
| + | 2764 | 5.6% |
| 5 | 2674 | 5.4% |
| 2 | 2511 | 5.1% |
| 9 | 2324 | 4.7% |
| 3 | 2169 | 4.4% |
| Other values (18) | 6465 |
Cyrillic
| Value | Count | Frequency (%) |
| Г | 5 | |
| ђ | 1 | 14.3% |
| Њ | 1 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 144126 | |
| Cyrillic | 7 | < 0.1% |
| Punctuation | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 14733 | 10.2% |
| m | 11004 | 7.6% |
| i | 8414 | 5.8% |
| . | 8383 | 5.8% |
| 1 | 7249 | 5.0% |
| o | 6442 | 4.5% |
| l | 6196 | 4.3% |
| 0 | 5934 | 4.1% |
| c | 5295 | 3.7% |
| g | 5132 | 3.6% |
| Other values (66) | 65344 |
Cyrillic
| Value | Count | Frequency (%) |
| Г | 5 | |
| ђ | 1 | 14.3% |
| Њ | 1 | 14.3% |
Punctuation
| Value | Count | Frequency (%) |
| ‚ | 1 | |
| ‘ | 1 | |
| „ | 1 |
Unnamed: 10
Text
MISSING 
| Distinct | 124 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 1035163 |
| Missing (%) | 98.7% |
| Memory size | 8.0 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 22 |
| Mean length | 2.4480316 |
| Min length | 1 |
Characters and Unicode
| Total characters | 32833 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 107 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | user |
|---|---|
| 2nd row | 0.9 |
| 3rd row | 0.9 |
| 4th row | 0 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 6610 | |
| 0.9 | 3883 | |
| user | 2117 | 15.9% |
| verified | 551 | 4.1% |
| college | 20 | 0.1% |
| premium | 7 | 0.1% |
| university | 7 | 0.1% |
| construction | 4 | < 0.1% |
| mechanic | 3 | < 0.1% |
| service | 3 | < 0.1% |
| Other values (125) | 138 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 10634 | |
| . | 3967 | 12.1% |
| 9 | 3920 | 11.9% |
| e | 3319 | 10.1% |
| r | 2718 | 8.3% |
| s | 2152 | 6.6% |
| u | 2136 | 6.5% |
| i | 1174 | 3.6% |
| v | 568 | 1.7% |
| d | 559 | 1.7% |
| Other values (47) | 1686 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15018 | |
| Lowercase Letter | 13638 | |
| Other Punctuation | 3973 | 12.1% |
| Space Separator | 109 | 0.3% |
| Uppercase Letter | 86 | 0.3% |
| Math Symbol | 7 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3319 | |
| r | 2718 | |
| s | 2152 | |
| u | 2136 | |
| i | 1174 | 8.6% |
| v | 568 | 4.2% |
| d | 559 | 4.1% |
| f | 555 | 4.1% |
| n | 71 | 0.5% |
| l | 70 | 0.5% |
| Other values (14) | 316 | 2.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 27 | |
| E | 12 | |
| U | 7 | 8.1% |
| M | 6 | 7.0% |
| F | 4 | 4.7% |
| T | 4 | 4.7% |
| H | 4 | 4.7% |
| I | 4 | 4.7% |
| L | 3 | 3.5% |
| P | 3 | 3.5% |
| Other values (6) | 12 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 10634 | |
| 9 | 3920 | 26.1% |
| 3 | 106 | 0.7% |
| 1 | 75 | 0.5% |
| 4 | 54 | 0.4% |
| 6 | 52 | 0.3% |
| 5 | 52 | 0.3% |
| 2 | 45 | 0.3% |
| 8 | 41 | 0.3% |
| 7 | 39 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3967 | |
| & | 3 | 0.1% |
| @ | 2 | 0.1% |
| / | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 109 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 7 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19109 | |
| Latin | 13724 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3319 | |
| r | 2718 | |
| s | 2152 | |
| u | 2136 | |
| i | 1174 | 8.6% |
| v | 568 | 4.1% |
| d | 559 | 4.1% |
| f | 555 | 4.0% |
| n | 71 | 0.5% |
| l | 70 | 0.5% |
| Other values (30) | 402 | 2.9% |
Common
| Value | Count | Frequency (%) |
| 0 | 10634 | |
| . | 3967 | 20.8% |
| 9 | 3920 | 20.5% |
| 109 | 0.6% | |
| 3 | 106 | 0.6% |
| 1 | 75 | 0.4% |
| 4 | 54 | 0.3% |
| 6 | 52 | 0.3% |
| 5 | 52 | 0.3% |
| 2 | 45 | 0.2% |
| Other values (7) | 95 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32833 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 10634 | |
| . | 3967 | 12.1% |
| 9 | 3920 | 11.9% |
| e | 3319 | 10.1% |
| r | 2718 | 8.3% |
| s | 2152 | 6.6% |
| u | 2136 | 6.5% |
| i | 1174 | 3.6% |
| v | 568 | 1.7% |
| d | 559 | 1.7% |
| Other values (47) | 1686 | 5.1% |
Unnamed: 11
Text
MISSING 
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 1041161 |
| Missing (%) | 99.3% |
| Memory size | 8.0 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 1 |
| Mean length | 2.2379282 |
| Min length | 1 |
Characters and Unicode
| Total characters | 16592 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 45 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 0.9 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0.9 |
| 5th row | 0 |
| Value | Count | Frequency (%) |
| 0 | 3948 | |
| 0.9 | 2371 | |
| user | 729 | 9.9% |
| verified | 229 | 3.1% |
| premium | 7 | 0.1% |
| services | 3 | < 0.1% |
| store | 3 | < 0.1% |
| andhra | 3 | < 0.1% |
| pradesh | 3 | < 0.1% |
| college | 3 | < 0.1% |
| Other values (57) | 65 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6380 | |
| . | 2401 | 14.5% |
| 9 | 2385 | 14.4% |
| e | 1240 | 7.5% |
| r | 995 | 6.0% |
| s | 746 | 4.5% |
| u | 742 | 4.5% |
| i | 485 | 2.9% |
| v | 237 | 1.4% |
| d | 237 | 1.4% |
| Other values (37) | 744 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8956 | |
| Lowercase Letter | 5107 | |
| Other Punctuation | 2402 | 14.5% |
| Space Separator | 93 | 0.6% |
| Uppercase Letter | 32 | 0.2% |
| Math Symbol | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1240 | |
| r | 995 | |
| s | 746 | |
| u | 742 | |
| i | 485 | 9.5% |
| v | 237 | 4.6% |
| d | 237 | 4.6% |
| f | 229 | 4.5% |
| n | 25 | 0.5% |
| o | 24 | 0.5% |
| Other values (12) | 147 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 6 | |
| P | 5 | |
| E | 4 | |
| M | 4 | |
| A | 3 | |
| U | 2 | 6.2% |
| H | 2 | 6.2% |
| G | 2 | 6.2% |
| S | 2 | 6.2% |
| I | 1 | 3.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6380 | |
| 9 | 2385 | 26.6% |
| 3 | 44 | 0.5% |
| 6 | 25 | 0.3% |
| 7 | 25 | 0.3% |
| 2 | 21 | 0.2% |
| 5 | 21 | 0.2% |
| 4 | 21 | 0.2% |
| 1 | 20 | 0.2% |
| 8 | 14 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2401 | |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 93 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11453 | |
| Latin | 5139 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1240 | |
| r | 995 | |
| s | 746 | |
| u | 742 | |
| i | 485 | 9.4% |
| v | 237 | 4.6% |
| d | 237 | 4.6% |
| f | 229 | 4.5% |
| n | 25 | 0.5% |
| o | 24 | 0.5% |
| Other values (23) | 179 | 3.5% |
Common
| Value | Count | Frequency (%) |
| 0 | 6380 | |
| . | 2401 | 21.0% |
| 9 | 2385 | 20.8% |
| 93 | 0.8% | |
| 3 | 44 | 0.4% |
| 6 | 25 | 0.2% |
| 7 | 25 | 0.2% |
| 2 | 21 | 0.2% |
| 5 | 21 | 0.2% |
| 4 | 21 | 0.2% |
| Other values (4) | 37 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6380 | |
| . | 2401 | 14.5% |
| 9 | 2385 | 14.4% |
| e | 1240 | 7.5% |
| r | 995 | 6.0% |
| s | 746 | 4.5% |
| u | 742 | 4.5% |
| i | 485 | 2.9% |
| v | 237 | 1.4% |
| d | 237 | 1.4% |
| Other values (37) | 744 | 4.5% |
Unnamed: 12
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 1045043 |
| Missing (%) | 99.7% |
| Memory size | 8.0 MiB |
| 0 | |
|---|---|
| 0.9 | |
| user | |
| verified | 81 |
| 22 | |
| Other values (19) | 21 |
Length
| Max length | 22 |
|---|---|
| Median length | 1 |
| Mean length | 1.8544734 |
| Min length | 1 |
Characters and Unicode
| Total characters | 6550 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0.9 |
| 4th row | 0.9 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2396 | 0.2% |
| 0.9 | 757 | 0.1% |
| user | 255 | < 0.1% |
| verified | 81 | < 0.1% |
| 22 | < 0.1% | |
| premium | 3 | < 0.1% |
| 0.3117503 | 1 | < 0.1% |
| Consulting | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 0.5491378 | 1 | < 0.1% |
| Other values (14) | 14 | < 0.1% |
| (Missing) | 1045043 |
Length
| Value | Count | Frequency (%) |
| 0 | 2396 | |
| 0.9 | 757 | 21.5% |
| user | 255 | 7.3% |
| verified | 81 | 2.3% |
| premium | 3 | 0.1% |
| and | 1 | < 0.1% |
| bank | 1 | < 0.1% |
| 0.30825663 | 1 | < 0.1% |
| 0.312846 | 1 | < 0.1% |
| technology | 1 | < 0.1% |
| Other values (16) | 16 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3166 | |
| . | 765 | 11.7% |
| 9 | 759 | 11.6% |
| e | 424 | 6.5% |
| r | 345 | 5.3% |
| u | 259 | 4.0% |
| s | 259 | 4.0% |
| i | 175 | 2.7% |
| d | 84 | 1.3% |
| f | 83 | 1.3% |
| Other values (34) | 231 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3979 | |
| Lowercase Letter | 1768 | |
| Other Punctuation | 766 | 11.7% |
| Space Separator | 25 | 0.4% |
| Uppercase Letter | 9 | 0.1% |
| Math Symbol | 1 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 424 | |
| r | 345 | |
| u | 259 | |
| s | 259 | |
| i | 175 | |
| d | 84 | 4.8% |
| f | 83 | 4.7% |
| v | 82 | 4.6% |
| n | 15 | 0.8% |
| m | 8 | 0.5% |
| Other values (10) | 34 | 1.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3166 | |
| 9 | 759 | 19.1% |
| 3 | 11 | 0.3% |
| 7 | 8 | 0.2% |
| 8 | 8 | 0.2% |
| 1 | 7 | 0.2% |
| 2 | 6 | 0.2% |
| 4 | 6 | 0.2% |
| 6 | 5 | 0.1% |
| 5 | 3 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 2 | |
| U | 1 | |
| T | 1 | |
| N | 1 | |
| E | 1 | |
| C | 1 | |
| I | 1 | |
| B | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 765 | |
| @ | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 25 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4773 | |
| Latin | 1777 | 27.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 424 | |
| r | 345 | |
| u | 259 | |
| s | 259 | |
| i | 175 | |
| d | 84 | 4.7% |
| f | 83 | 4.7% |
| v | 82 | 4.6% |
| n | 15 | 0.8% |
| m | 8 | 0.5% |
| Other values (18) | 43 | 2.4% |
Common
| Value | Count | Frequency (%) |
| 0 | 3166 | |
| . | 765 | 16.0% |
| 9 | 759 | 15.9% |
| 25 | 0.5% | |
| 3 | 11 | 0.2% |
| 7 | 8 | 0.2% |
| 8 | 8 | 0.2% |
| 1 | 7 | 0.1% |
| 2 | 6 | 0.1% |
| 4 | 6 | 0.1% |
| Other values (6) | 12 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6550 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3166 | |
| . | 765 | 11.7% |
| 9 | 759 | 11.6% |
| e | 424 | 6.5% |
| r | 345 | 5.3% |
| u | 259 | 4.0% |
| s | 259 | 4.0% |
| i | 175 | 2.7% |
| d | 84 | 1.3% |
| f | 83 | 1.3% |
| Other values (34) | 231 | 3.5% |
Unnamed: 13
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1047413 |
|---|---|
| Missing (%) | 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 14
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048162 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 15
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048457 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 16
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048531 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 17
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048555 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 18
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048561 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 19
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048564 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 20
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048568 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 21
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048569 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 22
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048571 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 23
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048571 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 24
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048571 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 25
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048572 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 26
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1048574 |
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
| 0.9 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0.9 |
|---|
Common Values
| Value | Count | Frequency (%) |
| 0.9 | 1 | < 0.1% |
| (Missing) | 1048574 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.9 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1 | |
| . | 1 | |
| 9 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 | |
| Other Punctuation | 1 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 9 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1 | |
| . | 1 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1 | |
| . | 1 | |
| 9 | 1 |
Unnamed: 27
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1048574 |
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
| 0.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0.0 |
|---|
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1 | < 0.1% |
| (Missing) | 1048574 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 | |
| Other Punctuation | 1 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 |
Unnamed: 28
Text
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1048574 |
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | user |
|---|
| Value | Count | Frequency (%) |
| user | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 1 | |
| s | 1 | |
| e | 1 | |
| r | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 1 | |
| s | 1 | |
| e | 1 | |
| r | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 1 | |
| s | 1 | |
| e | 1 | |
| r | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 1 | |
| s | 1 | |
| e | 1 | |
| r | 1 |
Unnamed: 29
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048573 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 30
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1048572 |
|---|---|
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
Unnamed: 31
Categorical
HIGH CORRELATION  MISSING  UNIFORM 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1048573 |
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
| 0.9 | |
|---|---|
| 0.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 6 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0.9 |
|---|---|
| 2nd row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.9 | 1 | < 0.1% |
| 0.0 | 1 | < 0.1% |
| (Missing) | 1048573 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.9 | 1 | |
| 0.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3 | |
| . | 2 | |
| 9 | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Other Punctuation | 2 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 9 | 1 | 25.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3 | |
| . | 2 | |
| 9 | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3 | |
| . | 2 | |
| 9 | 1 | 16.7% |
Unnamed: 32
Categorical
CONSTANT  MISSING 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1048574 |
| Missing (%) | > 99.9% |
| Memory size | 8.0 MiB |
| 0.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0.0 |
|---|
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 1 | < 0.1% |
| (Missing) | 1048574 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 | |
| Other Punctuation | 1 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2 | |
| . | 1 |
| Number | Unnamed: 12 | Unnamed: 31 | |
|---|---|---|---|
| Number | 1.000 | 0.052 | 1.000 |
| Unnamed: 12 | 0.052 | 1.000 | 1.000 |
| Unnamed: 31 | 1.000 | 1.000 | 1.000 |
| Number | Carrier | Name | Gender | Address | JobTitle | CompanyName | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | Unnamed: 14 | Unnamed: 15 | Unnamed: 16 | Unnamed: 17 | Unnamed: 18 | Unnamed: 19 | Unnamed: 20 | Unnamed: 21 | Unnamed: 22 | Unnamed: 23 | Unnamed: 24 | Unnamed: 25 | Unnamed: 26 | Unnamed: 27 | Unnamed: 28 | Unnamed: 29 | Unnamed: 30 | Unnamed: 31 | Unnamed: 32 | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 917032000911 | Airtel | Raj Kumar. Mncl | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | 917032001170 | Airtel | Deva | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 2 | 917032001495 | Airtel | Lakshay | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | 917032001483 | Airtel | Prathap | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 917032001471 | Airtel | Shekar Chinnu | NaN | Andhra Pradesh in | NaN | NaN | shekarchinnu749@gmail.com | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | 917032001466 | Airtel | Subba Rami Reddy | NaN | Andhra Pradesh in | NaN | NaN | subbaramireddy0@gmail.com | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | 917032001460 | Airtel | Thati Venkatesh | NaN | Andhra Pradesh in | NaN | NaN | thativenkatesh2000@gmail.com | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 917032001445 | Airtel | J S Anil | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 8 | 917032001438 | Airtel | Sunitha | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | 917032001435 | Airtel | Vaiven Smiley | NaN | Andhra Pradesh in | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| Number | Carrier | Name | Gender | Address | JobTitle | CompanyName | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | Unnamed: 14 | Unnamed: 15 | Unnamed: 16 | Unnamed: 17 | Unnamed: 18 | Unnamed: 19 | Unnamed: 20 | Unnamed: 21 | Unnamed: 22 | Unnamed: 23 | Unnamed: 24 | Unnamed: 25 | Unnamed: 26 | Unnamed: 27 | Unnamed: 28 | Unnamed: 29 | Unnamed: 30 | Unnamed: 31 | Unnamed: 32 | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1048565 | 917995485166 | Airtel | NaN | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1048566 | 917995485162 | Airtel | Parameshwari Bzc | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1048567 | 917995485160 | Airtel | Meshak Abraham | NaN | Andhra Pradesh in | NaN | NaN | abhiram6691@gmail.com | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1048568 | 917995485157 | Airtel | Baby. Lx. | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1048569 | 917995485155 | Airtel | Tailler | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1048570 | 917995485149 | Airtel | Parameswari | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1048571 | 917995485148 | Airtel | T.r.reddy4 | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1048572 | 917995485142 | Airtel | N Sures | NaN | Andhra Pradesh | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1048573 | 917995485139 | Airtel | P P | NaN | Andhra Pradesh in | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 1048574 | 917995485133 | Airtel | Nani Nani | NaN | Andhra Pradesh in | NaN | NaN | naninani1199552@gmail.com | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |